Skip to content

Conversation

@mkultraWasHere
Copy link
Contributor

@mkultraWasHere mkultraWasHere commented Dec 17, 2025

Summary

  • Added TaskOutputBuffer toolset providing a stateful output buffer for accumulating task results
  • Agents can incrementally build up output across multiple steps using output() and retrieve results with get_output()

Impetus: for certain evals we now require the model to specifically mark the output that is used to determine score (versus just checking any output supplied throughout entire trajectory). Thus with this tool, all the output content is stored on the tool, then grabbed later by eval scorers to verify.


Generated Summary:

  • Added TaskOutputBuffer class for stateful output management, allowing agents to accumulate results incrementally.
  • Introduced two methods within TaskOutputBuffer :
    • output: Adds content to an internal buffer and returns the total count of outputs.
    • get_output: Retrieves all previously saved outputs in order.
  • Updated docstrings for clarity and adherence to markdown formatting.
  • Imported PrivateAttr from pydantic for managing internal attributes securely.
  • Enhanced error handling by integrating catch=True in tool methods, improving robustness during execution.
  • Overall, these changes improve task management and output reporting capabilities within the agent framework.

This summary was generated with ❤️ by rigging

@mkultraWasHere mkultraWasHere changed the title additional task tool feature: additional task tool Dec 17, 2025
@mkultraWasHere mkultraWasHere changed the title feature: additional task tool feat: additional task tool Dec 17, 2025
Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR adds a new TaskOutput toolset for stateful output management and fixes markdown formatting in existing docstrings.

  • Adds a new TaskOutput class that provides an output buffer for agents to accumulate task results
  • Fixes markdown formatting in give_up_on_task docstring for better rendering
  • Imports necessary dependencies including PrivateAttr from pydantic and Toolset/tool_method from base

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

@mkultraWasHere mkultraWasHere added this pull request to the merge queue Dec 18, 2025
@github-merge-queue github-merge-queue bot removed this pull request from the merge queue due to failed status checks Dec 18, 2025
@mkultraWasHere mkultraWasHere added this pull request to the merge queue Dec 18, 2025
Merged via the queue into main with commit 4685fe8 Dec 18, 2025
8 checks passed
@mkultraWasHere mkultraWasHere deleted the mk/task_output branch December 18, 2025 19:47
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants